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Abstract 



^ The behavior of RPC detectors is highly sensitive to environmental variables. A novel approach is presented to model the behavior 
bf RPC detectors in a variety of experimental conditions. The algorithm, based on Artificial Neural Networks, has been developed 
I— I find tested on the CMS RPC gas gain monitoring system during commissioning. 
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1. Introduction 22 

23 

Resistive Plate Chamber (RPC) detectors [1] are widely ^'^ 
used in HEP experiments for muon detection and trigger- 
ing at high-energy, high-luminosity hadron colliders [2,3],^^ 
in astroparticle physics experiments for the detection of ex- 2'' 
tended air showers [4], as well as in medical and imaging 
applications [5]. At the LHC, the muon system of the CMS 
experiment [6] relies on drift tubes, cathode strip chambers-"' 
andRPCs[7]. 3i 

In this paper a new approach is proposed to model the 
behavior of an RPC detector via a multivariate strategy. 
Full details on the developed algorithm and results can be ^'^ 
found in Ref . [8] . The algorithm, based on Artificial Neu- 
ral Networks (ANN), allows one to predict the behavior 
of RPCs as a function of a set of variables, once enough 
data is available to provide a training to the ANN. At the 
present stage only environmental variables (temperature 
T, atmospheric pressure p and relative humidity H) have*" 
been considered. Further studies including radiation dose^^ 
are underway and will be the subject of a forthcoming pa- '^^ 
per. In a preliminary phase we trained a neural network '^^ 
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with just one variable and we found out, as expected, that 
the predictions are improved after adding more variables 
into the network. The agreement found between data and 
prediction has to be considered a pessimistic evaluation of 
the validity of the algorithm, since it also depends on the 
presence of unknown variables not considered for training. 

The data for this study have been collected utilizing the 
gas gain monitoring (GGM) system [9] [10] [11] of the CMS 
RPC muon detector during the commissioning with cosmic 
rays in the ISR test area at CERN. 

The GGM system is composed by the same type of RPC 
used in the CMS detector (2 mm-thick Bakelite gaps) but 
of smaller size (50x50 cm^). Twelve gaps are arranged in a 
stack. The trigger is provided by four out of twelve gaps of 
the stack, while the remaining eight gaps are used to mon- 
itor the working point by means of a cosmic ray telescope 
based on RPC detectors. 

In this study, the GGM was operated in open loop mode 
with a Freon 95.5%, Isobutane 4.2%, SFg 0.3% gas mixture. 
Six out of eight monitoring gaps were used, two out of eight 
monitoring gaps failed during the study and were therefore 
excluded from the analysis. The monitoring is performed 
by measuring the charge distributions of each chamber. 
The six gaps are operated at different high voltages, fixed 
for each chamber, in order to monitor the total range of 



operating modes of the gaps (Table 1) . The operation mode 84 
of the RPC changes as a function of the voltage applied, in 85 
particular the chamber will change from avalanche mode 86 
to streamer mode when increasing HV. 87 

Tabic 1 

Applied high voltage for power supplies for GGM RPC detectors ^9 
used in this study 
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Applied high voltage (kV) 


10.2 


9.8 
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10.4 


10.2 


10.4 
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2. The Artificial Neural Network simulation code ^3 

94 

An Artificial Neural Network (ANN) is an information^^ 

processing paradigm that is inspired by the way biological'^ 
nervous systems, such as the brain, process information[12] . 
The most common type of artificial neural network (Fig. 1) 
consists of three groups, or layers, of units: a layer of in- ^ 
put units is connected to a layer of hidden units, which 
connected to a layer of output unit. The activity of the in-"' 
put units represents the raw information that is fed intc?"^ 
the network. The activity of each hidden unit is determined'^''^ 

104 

INPUT HIDDEN OUTPUT 



estimates the error and modifies the neurons weights in 
order to minimize the estimated error. 

The training phase consists of determining both weights 
and configuration (nunber of neurons and number of layers) 
by minimizing the error, i.e., the difference between data 
and output. 

3. Environmentcd variables and datasets 

The environmental variables are monitored by an Oregon 
Scientific weather station WMRIOO. The DAQ has been 
modified in order to acquire via USB the environmental 
informations and merge environmental variables with out- 
put variables. The acciiracy of the temperature sensor is 
±rC in the range - 40°C and the resolution is 0.1°C. 
The relative humidity sensor has an operating range from 
2% to 98% with a 1% resolution, ±7% absolute accuracy 
from 25% to 40%, and ±5% from 40% to 80%. The barom- 
eter operational range is between 700 mbar and 1050 mbar 
with a 1 mbar resolution and a ±10 mbar accuracy. 

The online monitoring system records the ambient tem- 
perature, pressure and humidity of the GGM box that con- 
tains the RPC stack. Pressure and temperature are mainly 
responsible of different detector behavior as well as the hu- 
midity for the bakelite and gas properties. 

The used dataset is composed of four periods, each period 
composed of runs (about 270 each). Each run contains 10^ 
cosmic ray events where environmental variables and GGM 
anodic output charges {Q) are collected. The acquisition 
rate is typically 9.5 Hz. 



112 

Fig. 1. Example of a simple Neural Network configuration. 

by the activities of the input units and the weights on th^is 
connections between the inputs and the hidden units. Th^i'* 
behavior of the output units depends on the activity of thesis 
hidden units and the weights between the hidden and out-^i^ 
put units. For this study temperature, humidity and pres-^i^ 
sure have been selected as inputs and anodic charge as out-^s 
put variable. It was demonstrated[13] that the number ofis 
layers is not critical for the network performance, so we de-^20 
cidcd to go with 3 layers and give to the neural network a^2i 
sufficient number of hidden units automatically optimizec|22 
by a genetic algorithm that can take into account severaP^ 
configurations. 

For each configuration a genetic algorithm performs th^24 
training process with an estimation of the global error; then 
the configuration is stored and the genetic algorithm con-125 
tinues to evaluate a slightly different configuration. Onca26 
the algorithm has taken into account all the possible con-i27 
figurations the best one in terms of global error is chosen. 128 

During the training phase the network is taught with29 
environmental data as input, the output depends on thaso 
neuronal weights, that at the very beginning are initializedai 
with random numbers. The network output is compared ta32 
the experimental data we want to model, then the networkas 



4. Results 

Typical ANN outputs show generally good agreement 
between data and prediction during training phase. (Fig. 2 
(a)). In periods where the prediction is not accurate, the 
discrepancy is typically concentrated in narrow regions 
("spikes" ). Fig 2 (6) shows the prediction on period 3 using 
the period 1 as training, the discrepancy around run 137 
and run 256 are due to a set of environmental variables 
not available in the training period as shown in Fig. 2 (c) 
and Fig. 3. 

The comparison between data and prediction is shown 
in Fig. 4 where the quantity 

AQ _ Qexp — Qann 
Q Qexp 

is plotted for all four periods both for training (top) and 
predictions (bottom), divided for training and prediction 
respectively. The error distribution for the predictions is 
much wider than for the training, as expected. 

The gaussian fit superimposed (Fig. 4) is not able to fit 
the data properly due to the presence of large nongaussian 
tails, which are caused by runs with very large discrepancy 
between data and prediction. To evaluate the width a of 
the error distribution we perform a gaussian fit in a reduced 
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Fig. 2. (a) Gap 7 trained on the period 3 - prediction on period 3; the 
prediction is performed on the same period used as training with very 
good agreement between experimental data and prediction, (b) Gap 
7 trained on the period 1 - prediction on period 3, the prediction is 
performed on a period different from the training one, the agreement 
depends on dispersion of environmental variables, (c) Environmental 
variables during the period 3. 
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Fig. 3. Environmental variables during the period 1 



range which does not take into account the nongaussian 
tails. The distribution of the error for the predictions shows 
a (7 = 6.7%. In the Table 3 there is a summary with error for 
training and predictions. The cases with very large discrep- 
ancy were studied in detail, and found to be characterized 
by a {p, T, H) value at the edges of the variables space. 

To determine the measure of the dispersion of the envi- 
ronmental variables considering all the runs {N) we com- 
puted the: 
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n 2 



(2) 
(3) 



The distribution of the error as a function of the 

dispersion of environmental variables (Fig. 5) shows 
three distinct structures. The satellite bands with very large 
error were studied in detail. All data point in such banda55 
belong to period four and gap six for which problems wera56 
detected. Period four and gap six therefore were excludedi57 
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Fig. 4. Error for training (top) and prediction (bottom) for all runs. 
Gaussian fit superimposed. The quantity a is the width of the gaus- 
sian fit to the data in a reduced range which excludes the nongaus- 
sian tails. 

in the analysis. The distribution of the error as a function 
of dispersion of environmental variables after this selection 
has a (7 ^ 4% width and nongaussian tails extending up to 
^ = 200%. 




fiducial boundary 



10 



AX/X 



Fir 



5. Distribution of as a function of the dispersion of environ- 



mental variables for all periods, six gaps and both training and 
prediction. Each training period is included once, each prediction is 
included 4 times, due to different training period chosen. 

A selection on the fiducial volume in the x variables space 
(Table 2) was applied in order to exclude from the anal- 
ysis data with {p, T, H) close to the edges of the variable 



158 space. After the selection cuts, prediction on two periodass 

159 based on training on the third period were performed. Thaag 

160 nongaussian (NG) tails were defined as the fractional areago 

161 outside the region ±4it. The selection cuts slightly reducagi 

162 the width (ct < 3.7%), while drastically reducing the non-192 

163 gaussian tails (Table 3). 193 

Table 2 

Synopsis of the selection cuts for fiducial volume applied to predictcdi^s 
data_^ 196 
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(958 <p< 968)mbar (19.4 < T < 20.4)°C (34 < if < 44)% 197 

198 

Tabic 3 199 
Summary of errors a and nongaussian (NG) tails for various selectionzoo 



Data sets 




NG tail 


202 
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203 


All six chambers, all four periods training 


2.7 


2.26 
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205 


All six chambers, all four periods prediction 


6.7 


6.60 


206 


Chamber six and period four excluded prediction 


3.0 


4.63 


207 


Predict, on per. 2 and 3, train, on per. 1 


4.0 


3.52 




Predict, on per. 3 and 1, train, on per. 2 


3.4 


2.95 
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Predict, on per. 1 and 2, train, on per. 3 


3.8 


1.63 




Predict, on per. 2 and 3, train, on per. 1, fiducial cuts 


3.7 


0.49 


209 
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Predict, on per. 3 and 1, train, on per. 2, fiducial cuts 


2.9 


0.98 
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Predict, on per. 1 and 2, train, on per. 3, fiducial cuts 


3.3 


0.29 


212 
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In this study the GGM is the system used to train th^^^ 

neural network with anode charge as output variable andsis 

{p, T, H) as input variable. The addition of the dark currcnt^jg 

as a output variable and dose as input variable is expected 

to improve predictions and will be implemented. The main^^j^ 

advantage of this approach is that several variables can be 

222 

used together in order to predict chamber behavior without 

the needs of studying the surface corrosion, environmen-^^^ 

224 

tal/radiation dependence and bakelite aging due to chcmi-^^j 
cal reactions and deposits; also in the ANN analysis, givcr^je 
enough data, it is possible to decouple the cffcict of the cho-227 
sen variables used as output. This approach, once properlj228 
trained, coiild spot immediately and online pathologicat29 
chambers whose behavior is shifting from the normal one.23o 
Further stiidies are in progress to determine and cure the3i 
residual nongaussian tails of the ^ errors distributions ^232 
to deal with training and prediction on detectors with dif-233 
ferent high voltage supply, to widen the sample of environ-234 
mental conditions, and in adding new dimensions to th^35 
variables space such as radiation levels. 236 



tained using data from the CMS RPC GGM system. The 
ANN was trained for predicting the behavior of the anode 
charge Q (output variables) as function of the environ- 
mental variables {p,T,H) (input variables), resulting in a 
prediction error — 4%. In a forthcoming work we plan 
to include the dose as input variable and the dark current 
as output variable, aiming at a further improvement on 
the predictions. 
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186 A new approach for modeling the RPC behavior, based 

187 on ANN, has been introduced and preliminary results ob- 
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